数据集:
12ml/e-CARE 3A12ml/e-CARE 3Aacronym_identification 3Aade_corpus_v2 3Aag_news 3Aai2_arc 3Aalisawuffles/WANLI 3Aamazon_polarity 3Aamazon_reviews_multi 3Aamericas_nli 3Aamydeng2000/CREAK 3AAndyChiang/cloth 3AAndyChiang/dgen 3Aanli 3AAnthropic/hh-rlhf 3AAnthropic/model-written-evals 3Aapp_reviews 3Aaps/dynahate 3Aaqua_rat 3Aart 3Abanking77 3Abigbench 3Ablimp 3Ablog_authorship_corpus 3Acardiffnlp/tweet_sentiment_multilingual 3Acirca 3Acivil_comments 3Acodah 3AColumbiaNLP/FLUTE 3Acommonsense_qa 3Aconll2003 3Acopenlu/scientific-exaggeration-detection 3Acorypaik/prost 3Acosmos_qa 3Acos_e 3ADahoas/synthetic-instruct-gptj-pairwise 3Adbpedia_14 3Adeclare-lab/cicero 3Adefinite_pronoun_resolution 3Ademelin/moral_stories 3Ademo-org/auditor_review 3Adiscovery 3Adream 3Adynabench/dynasent 3Aemo 3AEricwang/promptCoherence 3AEricwang/promptGrammar 3AEricwang/promptNLI 3AEricwang/promptProficiency 3AEricwang/promptSentiment 3AEricwang/promptSpoke 3Aethos 3Aexams 3Afinancial_phrasebank 3AGBaker/MedQA-USMLE-4-options-hf 3Aglue 3Agoogle_wellformed_query 3Ago_emotions 3Ahans 3Ahas_part 3Ahate_speech18 3Ahate_speech_offensive 3Ahead_qa 3Ahealth_fact 3Ahellaswag 3Ahlgd 3Ahope_edi 3Ahumicroedit 3AIlyaGusev/headline_cause 3Aimdb 3Ainverse-scaling/NeQA 3Ainverse-scaling/quote-repetition 3Ainverse-scaling/redefine-math 3AJiangjie/ekar_english 3Ajnlpba 3Ajoey234/nan-nli 3Ajuletxara/xstory_cloze 3Alasha-nlp/CONDAQA 3Alaunch/open_question_type 3Alex_glue 3Aliar 3Alucasmccabe/logiqa 3Amartn-nguyen/contrast_nli 3Amath_qa 3Amaximedb/twentyquestions 3Amc_taco 3Amedical_questions_pairs 3Amedmcqa 3Ametaeval/acceptability-prediction 3Ametaeval/ambient 3Ametaeval/arct 3Ametaeval/autotnli 3Ametaeval/avicenna 3Ametaeval/babi_nli 3Ametaeval/boolq-natural-perturbations 3Ametaeval/chaos-mnli-ambiguity 3Ametaeval/clcd-english 3Ametaeval/cnli 3Ametaeval/commonsense_qa_2.0 3Ametaeval/counterfactually-augmented-imdb 3Ametaeval/counterfactually-augmented-snli 3Ametaeval/crowdflower 3Ametaeval/cycic_classification 3Ametaeval/cycic_multiplechoice 3Ametaeval/dadc-limit-nli 3Ametaeval/defeasible-nli 3Ametaeval/equate 3Ametaeval/ethics 3Ametaeval/folio 3Ametaeval/help-nli 3Ametaeval/implicatures 3Ametaeval/implicit-hate-stg1 3Ametaeval/imppres 3Ametaeval/lingnli 3Ametaeval/linguisticprobing 3Ametaeval/logiqa-2.0-nli 3Ametaeval/lonli 3Ametaeval/monotonicity-entailment 3Ametaeval/mutual 3Ametaeval/natural-language-satisfiability 3Ametaeval/naturallogic 3Ametaeval/nli-veridicality-transitivity 3Ametaeval/path-naturalness-prediction 3Ametaeval/puzzte 3Ametaeval/race-c 3Ametaeval/recast 3Ametaeval/reclor 3Ametaeval/ScienceQA_text_only 3Ametaeval/scinli 3Ametaeval/scruples 3Ametaeval/spartqa-mchoice 3Ametaeval/spartqa-yn 3Ametaeval/strategy-qa 3Ametaeval/sts-companion 3Ametaeval/syntactic-augmentation-nli 3Ametaeval/temporal-nli 3Ametaeval/tomi-nli 3Ametaeval/universal-joy 3Ametaeval/utilitarianism 3Ametaeval/wouldyourather 3Ametaeval/x-fact 3Ametaeval/xnli 3Amiam 3AMoritzLaurer/multilingual-NLI-26lang-2mil7 3Amteb/amazon_counterfactual 3Amwong/fever-evidence-related 3Ancbi_disease 3Anedjmaou/MLMA_hate_speech 3Anightingal3/fig-qa 3Anlpaueb/finer-139 3Anumer_sense 3Aoffenseval_dravidian 3Aonestop_qa 3Aopenai/summarize_from_feedback 3Aopenai/webgpt_comparisons 3AOpenAssistant/oasst1 3Aopenbookqa 3Apacovaldez/stackoverflow-questions 3Apapluca/language-identification 3Apasinit/xlwic 3Apaws 3Apaws-x 3APiC/phrase_similarity 3Apietrolesci/add_one_rte 3Apietrolesci/breaking_nli 3Apietrolesci/conj_nli 3Apietrolesci/dialogue_nli 3Apietrolesci/dnc 3Apietrolesci/fracas 3Apietrolesci/gen_debiased_nli 3Apietrolesci/glue_diagnostics 3Apietrolesci/gpt3_nli 3Apietrolesci/joci 3Apietrolesci/mpe 3Apietrolesci/nli_fever 3Apietrolesci/recast_white 3Apietrolesci/robust_nli 3Apietrolesci/robust_nli_is_sd 3Apietrolesci/robust_nli_li_ts 3Apiqa 3Apkavumba/balanced-copa 3Apoem_sentiment 3APolyAI/banking77 3Apragmeval 3Aprajjwal1/discosense 3Aqasc 3Aqbao775/PARARULE-Plus 3Aquail 3Aquarel 3Aquartz 3Aquora 3Arace 3Araquiba/Sarcasm_News_Headline 3Arelbert/lexical_relation_classification 3Ariddle_sense 3Arotten_tomatoes 3ARuyuanWan/Dilemmas_Disagreement 3ARuyuanWan/Dynasent_Disagreement 3ARuyuanWan/Politeness_Disagreement 3ARuyuanWan/SBIC_Disagreement 3ARuyuanWan/SChem_Disagreement 3Ascicite 3Asciq 3Ascitail 3Asem_eval_2010_task_8 3ASetFit/insincere-questions 3ASetFit/toxic_conversations 3Asick 3Asileod/attempto-nli 3Asileod/mindgames 3Asileod/probability_words_nli 3Asileod/wikimedqa 3Asilicone 3Asms_spam 3Asnips_built_in_intents 3Asnli 3Asocial_i_qa 3Aspecies_800 3ASpeedOfMagic/ontonotes_english 3Astanfordnlp/SHP 3Astrombergnlp/offenseval_2020 3Astrombergnlp/rumoureval_2019 3Astrombergnlp/x-stance 3Astsb_multi_mt 3Asuper_glue 3Aswag 3Atals/vitaminc 3Atasksource/babi_nli 3Atasksource/bigbench 3Atasksource/crowdflower 3Atasksource/I2D2 3Atasksource/mmlu 3Atasksource/oasst1_dense_flat 3Atasksource/oasst1_pairwise_rlhf_reward 3Atasksource/zero-shot-label-nli 3Atrec 3Atruthful_qa 3Aturingbench/TuringBench 3Atweets_hate_speech_detection 3Atweet_eval 3Atyqiangz/multilingual-sentiments 3Auniversal_dependencies 3Awiki_hop 3Awiki_qa 3Awili_2018 3Awinogrande 3Awiqa 3Awnut_17 3Axcopa 3Axcsr 3Axglue 3Axnli 3AYaHi/EffectiveFeedbackStudentWriting 3Ayahoo_answers_topics 3Ayelp_review_full 3Aylacombe/xsum_factuality 3Azapsdcn/citation_intent 3Azapsdcn/hyperpartisan_news 3Azapsdcn/sciie acronym_identification ade_corpus_v2 ag_news ai2_arc alisawuffles/WANLI amazon_polarity amazon_reviews_multi americas_nli amydeng2000/CREAK AndyChiang/cloth AndyChiang/dgen anli Anthropic/hh-rlhf Anthropic/model-written-evals app_reviews aps/dynahate aqua_rat art banking77 bigbench blimp blog_authorship_corpus cardiffnlp/tweet_sentiment_multilingual circa civil_comments codah ColumbiaNLP/FLUTE commonsense_qa conll2003 copenlu/scientific-exaggeration-detection corypaik/prost cosmos_qa cos_e Dahoas/synthetic-instruct-gptj-pairwise dbpedia_14 declare-lab/cicero definite_pronoun_resolution demelin/moral_stories demo-org/auditor_review discovery dream dynabench/dynasent emo Ericwang/promptCoherence Ericwang/promptGrammar Ericwang/promptNLI Ericwang/promptProficiency Ericwang/promptSentiment Ericwang/promptSpoke ethos exams financial_phrasebank GBaker/MedQA-USMLE-4-options-hf glue google_wellformed_query go_emotions hans has_part hate_speech18 hate_speech_offensive head_qa health_fact hellaswag hlgd hope_edi humicroedit IlyaGusev/headline_cause imdb inverse-scaling/NeQA inverse-scaling/quote-repetition inverse-scaling/redefine-math Jiangjie/ekar_english jnlpba joey234/nan-nli juletxara/xstory_cloze lasha-nlp/CONDAQA launch/open_question_type lex_glue liar lucasmccabe/logiqa martn-nguyen/contrast_nli math_qa maximedb/twentyquestions mc_taco medical_questions_pairs medmcqa metaeval/acceptability-prediction metaeval/ambient metaeval/arct metaeval/autotnli metaeval/avicenna metaeval/babi_nli metaeval/boolq-natural-perturbations metaeval/chaos-mnli-ambiguity metaeval/clcd-english metaeval/cnli metaeval/commonsense_qa_2.0 metaeval/counterfactually-augmented-imdb metaeval/counterfactually-augmented-snli metaeval/crowdflower metaeval/cycic_classification metaeval/cycic_multiplechoice metaeval/dadc-limit-nli metaeval/defeasible-nli metaeval/equate metaeval/ethics metaeval/folio metaeval/help-nli metaeval/implicatures metaeval/implicit-hate-stg1 metaeval/imppres metaeval/lingnli metaeval/linguisticprobing metaeval/logiqa-2.0-nli metaeval/lonli metaeval/monotonicity-entailment metaeval/mutual metaeval/natural-language-satisfiability metaeval/naturallogic metaeval/nli-veridicality-transitivity metaeval/path-naturalness-prediction metaeval/puzzte metaeval/race-c metaeval/recast metaeval/reclor metaeval/ScienceQA_text_only metaeval/scinli metaeval/scruples metaeval/spartqa-mchoice metaeval/spartqa-yn metaeval/strategy-qa metaeval/sts-companion metaeval/syntactic-augmentation-nli metaeval/temporal-nli metaeval/tomi-nli metaeval/universal-joy metaeval/utilitarianism metaeval/wouldyourather metaeval/x-fact metaeval/xnli miam MoritzLaurer/multilingual-NLI-26lang-2mil7 mteb/amazon_counterfactual mwong/fever-evidence-related ncbi_disease nedjmaou/MLMA_hate_speech nightingal3/fig-qa numer_sense offenseval_dravidian onestop_qa openai/summarize_from_feedback openai/webgpt_comparisons OpenAssistant/oasst1 openbookqa pacovaldez/stackoverflow-questions papluca/language-identification pasinit/xlwic paws paws-x PiC/phrase_similarity pietrolesci/add_one_rte pietrolesci/breaking_nli pietrolesci/conj_nli pietrolesci/dialogue_nli pietrolesci/dnc pietrolesci/fracas pietrolesci/gen_debiased_nli pietrolesci/glue_diagnostics pietrolesci/gpt3_nli pietrolesci/joci pietrolesci/mpe pietrolesci/nli_fever pietrolesci/recast_white pietrolesci/robust_nli pietrolesci/robust_nli_is_sd pietrolesci/robust_nli_li_ts piqa pkavumba/balanced-copa poem_sentiment PolyAI/banking77 pragmeval prajjwal1/discosense qasc qbao775/PARARULE-Plus quail quarel quartz quora race raquiba/Sarcasm_News_Headline relbert/lexical_relation_classification riddle_sense rotten_tomatoes RuyuanWan/Dilemmas_Disagreement RuyuanWan/Dynasent_Disagreement RuyuanWan/Politeness_Disagreement RuyuanWan/SBIC_Disagreement RuyuanWan/SChem_Disagreement scicite sciq scitail sem_eval_2010_task_8 SetFit/insincere-questions SetFit/toxic_conversations sick sileod/attempto-nli sileod/mindgames sileod/probability_words_nli sileod/wikimedqa silicone sms_spam snips_built_in_intents snli social_i_qa species_800 SpeedOfMagic/ontonotes_english stanfordnlp/SHP strombergnlp/offenseval_2020 strombergnlp/rumoureval_2019 strombergnlp/x-stance stsb_multi_mt super_glue swag tals/vitaminc tasksource/babi_nli tasksource/bigbench tasksource/crowdflower tasksource/I2D2 tasksource/mmlu tasksource/oasst1_dense_flat tasksource/oasst1_pairwise_rlhf_reward tasksource/zero-shot-label-nli trec truthful_qa turingbench/TuringBench tweets_hate_speech_detection tweet_eval tyqiangz/multilingual-sentiments universal_dependencies wiki_hop wiki_qa wili_2018 winogrande wiqa wnut_17 xcopa xcsr xglue xnli YaHi/EffectiveFeedbackStudentWriting yahoo_answers_topics yelp_review_full ylacombe/xsum_factuality zapsdcn/citation_intent zapsdcn/hyperpartisan_news zapsdcn/sciie其他:
deberta deberta-v2 deberta-v3 deberta-v3-base deberta-v3-large discourse discourse-connective discourse-connective-prediction discourse-marker discourse-marker-prediction discourse-relation-prediction extreme-mtl extreme-multi-task mdeberta-v3-base Eval Results multi-task multitask natural-language-inference nli pipeline pragmatics rlhf roberta tasksource 文本分类 zero-shot许可:
apache-2.0